The signal reconstruction of speech by KPCA

نویسندگان

  • Hui Yan
  • Xuegong Zhang
  • Yanda Li
  • Li Qin Shen
  • Weibin Zhu
چکیده

A new method for speech signal reconstruction is proposed by performing a nonlinear Kernel Principal Component Analysis (KPCA). By the use of kernel functions, one can efficiently compute principal components in high-dimensional feature spaces, and reconstruct vectors mapping from input space by those dominant principal components. As the reconstructed vectors is expressed in high dimensional feature space and they could not exist pre-image in input space. For finding pre-image, we use iteration method to approximate the pre-image. The experimental results using KPCA in data reconstruction and denoising in speech signal show that it had many potential advantages comparing with PCA. 1.PRINCIPLE Principal Component Analysis (PCA) is an orthogonal basis transformation. The new basis is founded by diagonalizing the centered covariance matrix of a data set, The coordinates in the Eigenvector basis are called principal components. The size of an Eigenvalue corresponding to an Eigenvector v of covariance matrix equals the amount of variance in the direction of v. Furthermore, the directions of the first n Eigenvectors corresponding to the biggest n Eigenvalues cover as much variance as possible by n orthogonal directions. In many applications they contain the most interesting information: for instance, in data compression, where we project onto the directions with biggest variance to retain as much information as possible, or in de-noising, where we deliberately drop directions with small variance. Assume that our data is mapped into feature space by nonlinear map ) ( : x x Φ > − Φ , ) ( , ), ( ), ( 2 1 l x x x Φ Φ Φ Λ , (1) And do PCA for the covariance matrix. ∑ = Φ Φ = l i T l 1 ) ( ) ( 1 i i x x C (2) We have to find Eigenvalues and Eigenvectors satisfying V C �V = . (3) This implies that we may consider the equivalent system. l j C i , , 1 ), ( ), ( Λ = > Φ >=< Φ < i j i j v x v x λ . (4) By defining > Φ Φ =< ) ( ), ( j i x x ij K , we get the expression

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

Kernel PCA for Speech Enhancement

In this paper, we apply kernel principal component analysis (kPCA), which has been successfully used for image denoising, to speech enhancement. In contrast to other enhancement methods which are based on the magnitude spectrum, we rather apply kPCA to complex spectral data. This is facilitated by Gaussian kernels. In the experiments, we show good noise reduction with few artifacts for noise co...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000